Limits to the Performance of Software Shared Memory: A Layered Approach
نویسندگان
چکیده
Much research has been done in fast communication on clusters and in protocols for supporting software shared memory across them. However, the end performance of applications that were written for the more proven hardware{ coherent shared memory is still not very good on these systems. Three major layers of software (and hardware) stand between the end user and parallel performance, each with its own functionality and performance characteristics. They include the communication layer, the software protocol layer that supports the programming model, and the application layer. These layers provide a useful framework to identify the key remaining limitations and bottlenecks in software shared memory systems, as well as the areas where optimization e orts might yield the greatest performance improvements. This paper performs such an integrated study, using this layered framework, for two types of software distributed shared memory systems: page-based shared virtual memory (SVM) and ne-grained software systems (FG). For the two system layers (communication and protocol), we focus on the performance costs of basic operations in the layers rather than on their functionalities. This is possible because their functionalities are now fairly mature. The less mature applications layer is treated through application restructuring. We examine the layers individually and in combination, understanding their implications for the two types of protocols and exposing the synergies among layers.
منابع مشابه
Collective Memory as a Measure to Evaluate the Infill Architecture Innovations in Historic Contexts (Case Study: Historic Context of Imamzadeh Yahya in Tehran)
Historic contexts remind us of an era when cities were built based on the needs, goals, and preferences of their inhabitants. In other words, the mental world of both the builders and the inhabitants was closely interrelated. But by ignoring citizens' memories and interests and their mental needs, today's interventions with rapid developments within historic contexts have led to amnesia and the...
متن کاملRemote Shared Memory over Sun Fire Link Interconnect
The interconnection networks and the communication system software in clusters of multiprocessors are critical to achieving high performance. Recently, Sun Microsystems has introduced a new system area network, Sun Fire Link interconnect, for its Sun Fire cluster systems with some performance results at the MPI level. Sun Fire Link is a memory-based interconnect with layered system software com...
متن کاملA Finite Volume Formulation for the Elasto-Plastic Analysis of Rectangular Mindlin-Reissner Plates, a Non-Layered Approach
This paper extends the previous work of authors and presents a non-layered Finite Volume formulation for the elasto-plastic analysis of Mindlin-Reissner plates. The incremental algorithm of the elasto-plastic solution procedure is shown in detail. The performance of the formulation is examined by analyzing of plates with different boundary conditions and loading types. The results are illustrat...
متن کاملOrca: a Portable User-Level Shared Object System *
Orca is an object-based distributed shared memory system that is designed for writing portable and efficient parallel programs. Orca hides the communication substrate from the programmer by providing an abstract communication model based on shared objects. Mutual exclusion and condition synchronization are cleanly integrated in the model. Orca has been implemented using a layered system, consis...
متن کاملModeling and Performance Evaluation of Multi-Processors Organization with Shared Memories
This paper is primarily concerned with theoretical evaluation of the performance of multiprocessors system. A markovian waiting line model has been developed for various different multi-processors configurations, with shared memory. The system is analysed at the request level rather than job level.
متن کامل